# Vision-Text Generation
Vit GPT2 Image Captioning
An image captioning model based on the ViT-GPT2 architecture, capable of generating natural language descriptions for input images.
Image-to-Text
Transformers

V
motheecreator
149
0
Vit GPT2 Image Captioning
An image captioning model based on the ViT-GPT2 architecture, capable of generating natural language descriptions for input images.
Image-to-Text
Transformers

V
mo-thecreator
17
0
Featured Recommended AI Models